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ABSTRACT 

An automated retrieval system for reference works was 
constructed as a prototype in order to illustrate the capabilities 
and potentialities of future retrieval systems. More than 170 
biographical works were indexed and stored in a computer disk to form 
a biographical data base, and a computer program was written to match 
patron queries. Consistent with the indexing^ the system retrieved 
appropriate reference sources to biographical reference problems. 
(Author/SK) 
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ABSTRACT 

The problem was to construot an automated retrieval system for 
biographical reference xx>rks. This retrieval system, although limited 
in scope uas to serve as a prototype of future retrieval systems and 
to illustrate the capabilities and potentialities of computerized 
reference retrieval systems. Over 170 biographical reference works 
were indexed in terms of ;hree categories; namely, (l) miscellaneous, 
(2) occupation, and (3) time period* These works were then stored on 
a comuuter disk; thus establishing a biographical data base* A computer 
program was then tn^itten to match patron queries with- this data base* 
Consistent with the indexing^ the system retrieved appropriate reference 
sources to biographical reference problems* 
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CHAPTER I 
THE HIQBZSK 

To most librarians it is very evident there is a tremendous 
volime and variety of reference sources available in today's modem 
research libraries* Bibliographic control of these reference vorka 
is difficult to maintain in a manner simple and understandable to 
the novice library user* Because of this^ patrons are constantly 
confronted by the problem of vhioh reference book to search to find 
information about various reference problems, Hence^ the patron 
generally stumbles around on his o-m, or asks a. reference librarian* 
The librarians in tum» attempts to recall^ either by memory, personal 
notes, or reference books, ^ioh type of reference works are most 
appropriate to the patron *s given problem* Sometimes, despite his 
training* and experience, even the librarian is unable to suggest 
accurately and speedily all the possible type of sources the patron 
may search to find an acceptable ansver* 

Hhe tremendous speed and memory capabilities of a computerized 
retrieval system can render assistance in suggesting various types of 
tools the patron may want* IdealHy, all the reference books (of 
a given library, for example) could be indeaeed and stored on a rapid 
access device so "tiiat a patron's question could then be fed into the 
system* Quickly and accurately, the computer would print out all the 
possible reference sources from which a piatron could find an acceptable 



answer* This project discussed such an automated system*. 

Statement of the Problem- . 
Briefly stated^ the problem was to construct an automated 
retrieval ^stem for biographical reference works* The paper was 
partitioned into two different parts* The first part of the paper, the 
literature search, concerned itself ^dth the following questions What 
is involved in implement3jig an automated reference system? In other 
•words, ?eneral3iy speaking, what problems occur in the implementation 
of any computerized reference system? 

• The second part of this paper concerned itself T^dth the construction 
of an automated reference search system based upon the elements studied 
in the first part of the paper* This system was not all things to all 
people* On the contrazy, the system was extremely limited, both in 
purpose and scope* The main purpose of the system ma to iUastrate 
the capabilities and potentialities of computerized reference search 
systems* The system's purpose was not to serve as a perfected retrieval 
system ready to use at the patron level, although it certainly could, 
but instead to sez^e as a prototype retrieval system that could be 
expanded and further refined for practical use at the reference desk* 

Delimitation of the ProblCT 
The system was limited in scope* TbQ data base was only composed 
of biographical reference vorks regarding persons ^Aio live within, or 
t^o have lived within, the United States from Colonial times to the 
present* Furthemore> it only included works presently held by the 
Harold B* Zee Library General Reference Department and shelved on the 
main (third) floor of that Olbrazy* All other biographical vorks oil 
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other floors viere not included. Although this retrieval system, was 
"on line" and fully automatic^ the assistance of a qualified person was 
still necessary to formulate and enter the patron's search profile inta 
the system* Hox^ever^ it trould be possible to augment the systen with 
comter-assisted instruction^ thus enabling the system to faction with 
a miniimim of supervisory assistance* 

The study did not discuss computer programming techniques or file 
organization to fjreat length. Moreover, the study did not deal with 
philosophical or theoretical concepts of reference service, nor with 
philosophical concepts of information theory, infoiration science, or 
computer-assisted Instruction systems or techniques* 



CHAPm u 

• • 

LITERATUBS SEARCH 
Attempts to automate library processes^ which, began seriously after 
World War H, have focused mainly on technical services including 
cataloging, acquisition, circulation control and serials* Little has 
been done in the area of reference work to utilize the capabilities of 
the computer. Although the reference librarian has seldom directly 
benefited .from automation, he has benefited indirectly. For example, 
in some libraries the holdings are on magnetic tape or in "machine 
readable form" -sMch makes it possible to produce both massive and 
selected bibliographies on given subjects. ♦^Key word in context" as 
well as "citation^* indexes are computer produced and established to 
provide bibliographical control more effectively for the reference 
librarian* 

Except for the two studies discussed in this paper, there are vexT 
few instances of functional automated systems which give direct and 
immediate help to reference queries^ There have been some atteopts to 
streamline bibliographic control of reference works using sard systems* 
These include the well known "punched card," "peek'^-boo," "edge-notched>* 
and "Uniterm" systems.*^ Each of these attempts to act as a means by 
which sources are categorized and subdivided on a card and then visually 

^Claire K. Schultz, "Automation of Reference Work," library 
Trends 12 (Januaiy 1961) s Ia3-llt# 
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examined by the librarian to detemine appropriate sources for a 
particular need* Althouf;h they offer some help> they are not very- 
practical or efficient^ nor do they use the speed and memory qualities 
of a computer* 

Tm research pro,ject8y described in the literature> report 
experimentation with automated reference search systems. These projects 
trere conducted in the last five yearss one at the Uhiversity of Chicago 
and the other at the University of California at Berkeley* Both of 
these projects attempted actually to automate a reference system* 
Automation, in the context of these two experiments^ means the indexing 
of reference vrorks. with descriptors, the feeding of this information 
onto a rapid access storage device, and the oreranizing- of a data bank 
for retrieval purposes • To date, these have been the only two major 
projects dealing with comptiterized reference services* Ifach of the 
rema5jider of the iiteraiy search portion of this paper was devoted to 
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an examination of these two systems, giving a good idea of the state of 
the art of automated reference systems* 

Examination of Weil's System 

The first system examined ima the one established by Kiss Cherie 
B* W^il at the Graduate School of Library Science at the University 
of Chicago in 1967* Miss Vfeil attempted to establish an autonated 
biographical reference search system that would advise the patron i^ch 
types of biographies would be most appropriate to check in terms of a 
given biographical ref erentie question* 

tfeil assumed that biographical reference works (which are confined 
to information about people) have something in common. She felt that 



inost biographical vorks generally give ths same type of information for 
each person*^ This assumption further led her to assume that it was 
possible to index biof^phies ^thin a computer in such a way as to 
establish an automated retrieval system* ^chanioal retrieval could be 
achieved by matching the index teims assigned to the reference ttorka 
•with the patron's request through an appropriate retrieval rule* \ifeil 
felt that an advantaf;:e of such a system would be to free the librarian 
80 that he could engage in less mechanical and more scholarly tastks* 

vreil*s system dealt with all types of biographical reference works 
£rom all types of occupations and nationalities* These biographie&i were 
indexed in terns of a unique classification scheme* She used nine 
different index descriptors to describe each biographical work. They aret* 

(1) Living (Are the subjects considered in the book living or dead?) 

(2) Nationality (Does the work include only certain nationalities?) 

(3) Occupation (Is the work- limited to only certain occupations?) 
(It) Minority Groups (Is it limited to specif ic minority groups?) 
(^) Date (t'lhat time period does the work cover?) 

(6) Indexes (t^liat separate indexes^ if any» index the work?) 

(7) Sex (Is the book limited to any one sex?) 

(8^ Specifics 1 (Other information that describes the work*)2 
(9) Specifics 2 (Other information that describes the work*) 

The following is an example of a biographical reference work that is 

completely eodeds 

DICTramiO/ INDEX FIEID X, LIVING N X, OCCUP Z, SEX A, HAT PHILP 
ASIAN X, SPEC 1 D C DS EL BP L CL (3M DG E I Z, DATA SO's X, SPEC 2 
P PL R MS PD 2, MIN Z^J 

Interpreted^ the above entry means s Dictionary of Hiillipplne Biography; 



Cherle B. Weil, "Automatic Betrieval of Biographical Reference 
Books," Journal of Library Automation 1 (December 1968) « 22;0* 

2 

Ibid*^ p* 2ltl» 
3lbid., p* 2l*?r 



a book lijnited to dead Filipinos and Riving for each entry,, the dates, 
earner, descendants, field, birthplace, lonp articles, class in college, 
degrees, education, pictures, parents, publications, references, marital 
status, and physical description* Each refer«dnce ^Kork \ms indexed in 
terms of the nine divisions, since a fixed fozmat -ms used in the file 
structure* Ihis meant that a certain amount of space ms allotted each 
index category for each work indexed* 

vreil indexed, in the above manner, maz^ different biographical 
reference works* The data were stcred on tape, disk or cards, which 
would then be accessible by computer, thus establishii^ig <i sizeable 
computer data bank of biographical reference works* This data bank was 
then used to benefit both the librarian and the patron directly* For 
example, if a patron i^nted a biography of a Dutch economist^ that question 
could be fed into the machine and those works ^ose index records matched 
the request would be retrieved* 

The reader may ask the ouestiont ^^How vsfs the machine able to 

interpret the patron's request, for a bio^aphy of a Dutch economist?'' 

This reouest was coded in the same manner as. the biogra^ical reference 

works were coded, so that the request for a Dutch economist would look 

like the foUowingr 

Qiiestion: SEX Z (or id, UVIWG T, NAT DUTCH, OCXJUP BOON, MIN Z, 
PATE 60' s, Si=EC 2 PL*-*^ 

It -Kss especially important that the patron's search profile be coded 

using exactly the same descriptors with ^^ch the vjorks were indexed* 

Furthermore, the search, profile must be constructed in the same order 

in which the wozics were coded* This means, for example, that the 

^id., p. 2h3t 

13 ■ 
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category «»SEX»» should always be first in the search profile, the 
category "LIVING or DEAD" should always be second, the category 
"MTIOHALITT' whould follow third, and so forth. In other words, for 
a aiatch to occur the search profile must be in exactly the same order 
as the works were indexed, 

As can be seen from the above search profile, the machine is 
programmed to compare the coded abbreviations of the patron* s request 
with the coded abbreviations of the biographical reference works within 
the data bank. The computer simply prints out the results of the 
match. This is basically how a^y automated reference system works. 

Examination of Meredith's System 
Following Weil's research in 1967 was another research project 
started in 1970 by Mr, Joseph C, Meredith at the Institute of Library 
Besearch at the University of California at Berkeley* Zferedith's* 
research was funded by the Office of Education, Mf?, Msredith had a larger 
staff, more funds, and was more sophisticated in establishing a larger 
data bank, but implemented some of the same elements as Weil, Hhe two 
projects mre similar In sowo respects, but in others, differed greatly. 

Meredith's system initially began as an attempt to construct a 
eonputer-^ssisted instruction systeza, not a reference search system. 
However, in the process of the- former, the latter developed. His 
BEFSEAHCH system, like Weil's, retrieved reference sources, not direct 
answers to the patron's question* As I4eredith put it, "It retrieves sure 
sources of information which are likely sources of specific infonnation»"^ 

^Joseph G* Meredith, Reference Search System (REFSEARCH) Users' 
Manual , (Berkeley: Institute of Library Research, University of 
California, Berkeley, 1971)^ p» !• 

. 14 ' 



His entlro collection vas divided into approximately sixteen different 
ma.ior divisions* In turn, each division was then subdivided by five 
different sets of subdivisions. Ultimately, the limitations of the 
system's usefulness depended on the exactness of the classification 
scheme* Suffice it to say thnt Meredith established a means by iihich 
he could identify and describe his reference collection* 

As mentioned above, Meredith's HEFSEARCH system vas much more 
comprehensive and sophisticated than Weil's* Meredith's staff spent 
about ten and one-half man«4iionths of labor before the entire coding or 
indexing of his 790 different reference works vas completely finished* 

Not only ms Mieredith's system more extensive, it differed from 
Weil's in that the former assigned numerical codes to. each of his 
descriptors* For example, in Meredith's system, the descriptor 
"LOCATES" could be represented by the numerical code 3h$$ or the 
descriptor "HlOPER-KAMHn)" could be represented by the numerical code 
33h and so on* The reason for this is that althou^ the compu ter can 
handle and match series of letters, it is more efficient, speedy^ and 
economical vhen it deals idth numbers* Furthermore, vhen alphabetical 
symbols are used, then an '^ASSEMBLER" must be inritten to translate the 
characters to numbers* To communicate in numbers is less convenient to 
the user, but more efficient for the computer* 

In Meredith's system, patron queries are submitted as a series 
of numbers separated by commas* An exainple of this vill help to clarify 
how his system vorked* Suppose a patron came to the reference desk with 
the following question: "I've heard that Kurt Godel, the mathematician* 
lives somewhere in the United States* Where can I find his home address?'* 
The librarian then studies the question, determines what kinds of 
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infoxmation are being dealt vith and vhart kind of service is needed^ 

and then proceeds to express the question in terms of the aforesaid 

descriptors* Hence, in teiros of Meredith's descriptors, the patron's 

question might look like the following: 

Required: A work which LOCATES (3li?) HEAL (339) PR0PI1R-M4MED 
(33li) INDIViriUAL (335) LIVING C3lil) PERSONS (333), Mast cover 
MATHEM»ViICIAWS (528) in the UNITED STATES (558).-^ 

The librarian would then type the following nunibers into the computer: 

3)69 339, 331) 33$9 311, 528, 558. The computer conqsares the set of 

numbers with the numbers within its data bank, makes a match, and then 

prints out the works that matched, Esccept that Meredith's system used 

coc'ed numbers to describe reference works, while Weil's used words to 

describe reference works, the two systems employ the same basic elements 

and procedures to implement the system* 

Summary and Conclusions of Literature Seareh 
The foregoing pages have attempted to analyze and describe the 
literature which describe what was involved in io^lementing a reference 
search system. Ifee literature search has especially centered on two 
research projects that explicitly dealt with the iji^lementation elements 
involved in a reference search system* The first project was Cherie 
Weil»s and the second was Joseph Meredith's* Although the two projects 
were independent of each other, some of the same elements and procedures 
were followed to implement both systems. Biese two projects were the only 
ones in -the literature that have seriously atteopted to automate 
reference services* 

^Joseph C. Meredith, "Machine-Assisted Approach to General 
Reference Materials," Journal of the American Society for Information 
Science 22 (May-June l9a)i 102. 
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Although Vfeil and Meredith actually constructed a computerized 
system that worked, neither of them used it at the reference desk 
level, so the library patron did not directly benefit from it* Both 
systems were used as an experimental prototype from vhich future systems 
could conceivably be built and used at the public service desk* 

Both systems used a unique classification scheme to indesc the 
reference works; however^ neither Weil nor Meredith claimed these 
schemes were fool proof* On the contrary^ they both admitted the 
system was no better than their application of the index terms to their 
reference collections* . iideed, Meredijbh admitted that his descriptors 
were not consistently applied throughout his collection* Meredith also 
said that any classification scheme is to a certain extent artificial in 
its attempts to identify and distinguish one reference work from another* 

Another criticism of Weil and Meredith was that initially neither 
system was capable of ranking the reference sources » from which 'a patron 
could find an answ^ to his quest^n, in order of jjiportance* For 
exantple, suppose a patron asked the questions "Who is Jesse Shera?'^ 
'Rie system might list two reference sources from which an answer could 
be found, namely; vnio»s Vflio in America and International Who's Who* 
As far as the patron is concerned, both sources equally possess the 
s&aie amount and kind of information on Shera, when in reality the 
foraer. Who's Who in America, would probably contain more information 
on him* 

One of the remarks'sle qualities of Meredith's and Weil's systems 
was the fact that they both harnessed the speed, memory, and efficiency 
capabilities of the computer* This machine can. very quickly conpare 
any combination of tems, arrive at a *>match>** and then print out the 
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uorks that deal with the combinations of topics desired. 7his is an 
advantage over card catalog systems vhere only one topic at a tijna 
can be dealt Kith efficiently* In terms of speed and accuracy^, 
although never officially and unequivocally proven, the computer 
certainly has the potential for superiority over the human librarian, 
since it never forgets any of the hundreds of details associated iiith 
each of the hundreds of reference books* 

Both Weil and Keredith followed a definite procedural pattern in 
establishing their systems* Certain sequential steps had to be 
followed in order to make their systems work. The most valuable aspect 
of the literature search portion of this paper was in studying the 
procedural steps taken both by Weil and Meredith. These steps were 
followed by the autiior as he attempted to establish an automated * 
reference search system* These steps were outlined in the methodology 
statement within the paper*- 

. :ihe literature search chapter yielded some excellent, scholarly, 
in-depth information on certain research projects conducted on the 
subject of automated retrieval systems* These projects, although 
imperfect, provided excellent information on the state of the art and on 
the implementation of a computerized reference system* These projects 
pointed the way into automation of public services, which is far behind 
automation of technical services* The projects are admittedly 
experimental, to a certain extent unrefined, and untried in terms of 
actual patron use, but they have given the author a basic formulation 
and understanding of computerized retrieval systems*. 
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CHAPTER m 

t 

! BUILDING THE RETRIEVAL SYSTEM 

Methodology 

An aiitoinated reference s^tstem tjas created by following basically 
the same procedural steps as Weil and Meredith* These steps foUowt 

(1) The system's ndesign** limitations were established. These 
limitations prevented the system from "grovingx too large because of 
the nature of file organization^ memory size^ and the like. Basically 
the exact types of works and how many of them are to be dealt with in 
the system were decided during the preliminary stages. This wa» 
important since these Initial boundaries limited the kinds of requests 
which were acceptable to the system and determined the areas with which 
the computer dealt* The system contained only biographical reference 
wozics that had information about persons who are living or have lived 
in the United States « including Alaska and Hawaii* from Colonial times 
to the present in all areas of activity. The ^tem was limited to 
only those biographical works housed or shelved on the third (main) 
floor General Reference area of the Harold B. Lee Library. All other 
biographical works on other floors were not included* 

(2) After the basic limitations were set, the system's scope was 
further limited by devising descriptive surrogates for each biographical 
reference source. The definite level of specificity and depth to which 
the iturrogates or descriptors would be applied was also established* 

In other words, a unique descriptive classification scheme for indexing 

13 
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the vorks was prepared* This step was most important^ since this ' 
classification scheme determined the extent to uhich the system was 
able to retrieve sources effectively. Ihe author indexed the ' 
biographical reference works in terms of (1) a miscellaneous category 
set^ which included sex^ regional location within the United States, 
religion, race, nationality within the United States, and age, (2) 
occupation categories and (3) time period categories* All works were 
indexed in terms of these three areas* 

(3) An authority list of the descriptors or index terms was 
compiled for use in the classification scheme* This list provided 
a means of standardization when indexing the works and also provided 
the base into which all requests were converted* . 

(U) The works were indexed in terms of the clasrification scheme 
by using the descriptors or index terms to describe and identify the 
works. Care was taken to apply accurately and consistently l^es'e terms 
throughout the collection of works* 

($) The information was stored on computer disk* This data base 
was developed by directly typing in the information through the computer 
terminal. Inf ozmation included all the bibliographic data in the works 
themselves (such as title, call number and explanatory notes) as weU. 
as their respective descriptors* The information may be stored not oxily 
on disk, but on any machine-readable medium such as cards, magnetic 
tape, paper tape, drums, etc. Moreover, it is not mandatory that the 
data be encoded lay using only the terminal. For example, the data 
could be keypunched onto cards and the cards then read into the computer* 
The key point is that bibliographic infonnation mast be converted to 
machine-readable form* 
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(6) The software vas created, which instructed the computer how 
to natch the patron's question «ith its data bank of biographical 
works* The author chose to -urite the program in COBOL language under 
the direction of Hr* Bruce Smith, Mr. Don Novak and Mr* Lany Oibson> 
all of whom are recent graduates of the computer science department 

at Brigham Young Universi-ty* Although the COBOL language was used and is 
vexy efficient in dealing with strings of characters, it is not the only- 
possible language that could have been used for this project* 

(7) TSae patron's questions were expressed in terms of the unique 
descriptor terminology* This enabled the computer, by using standardized 
index terms, to match the psitron's question with its extensive data bank 
of biographical information* The librarian must convert the patron's 
natural language question to a logical descriptor language question bgr 
using the list of index terms as developed in step three*. 

£^ carefully following the above methodology, the author established 
an experim^tal computerized reference system tiiat accurately retrieved 
sources of information* This system can be used to explore potential, 
improvements to user service at the reference desk* 

The remaining portion of this project applied each of the steps 
listed in the methodology statement to the building of the retrieval 
system* Since this project focused around building ah automated 
retrieval system, and since the methodology specifically dealt with 
that, the remainder of the project followed the methodology statement 
as a sequential outline* 

The author chose to deal with only United States biographical 
reference works that were shelved in the general reference arda of the 
third floor of the lihraiy* Over 170 such titles, consisting of many 
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more volumes^ were shelved in the general reference area, The author • 
chose to deal with biographical reference works for several reasonst 
firsts there was no exhaustive index that provided bibliographic access 
to the thousands of names listed in the hundreds of works; secondly, 
there were only a few reference tools, such as Winchell's Guide to 
Reference Works and Slocums' Biographical Dictionaries, which categorized 
biographies by occupation. These latter works provided a service similar 
to the computerized retrieval system, but the retrieval system had other 
advantages; thirdly, biographical works contain basically the same type 
of infozroation, are arranged in similar formats, can be indexed in terms 
of a general classification scheme, and hence, lend themselves to this 
soirt of project* 

Perhaps the most important step in this entire project was to 
devise an extensive, unique classification scheme which was appropriate 
to biographical reference wozics and which was both general as well as 
somewhat specific in nature* nQensral'* meant, very broad, widespread 
categories such as **scientist" or "businessman*'* "Somewhat specific* 
did not mean explicitly precise, but ref ered to an intermediate level 
of specificity, such as the terms "chemist" or "accountant*" Explicit 
precision, in the classification scheme, was unnecessary, since most 
biographical reference works only reached the intermediate level of 
specificity* Therefore, unless stated otherwise, the word "specific" 
did not refer to explicitly precise but only to an intermediate level 
of specificity* 

An explanation of the importance of an accurate classification 
seheae was necessary* A document was Indexed, being given the 
descriptors, "evaluation," "library schools," and "curriculum*" 

» 
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The implication vas tho document dealt vlth the evaluation of library 
school ciurrlculum> but later the document was actually found to deal 
vith the evaluation of elementaxy school libraxy programs* In this 
ease> the assirned descriptors did not accurately describe and identify 
the subject of the document. A user vould find it both disappointing 
and annoying; to retrieve such a document only to find it unrelated to 
his desired topic* Thus, any classification scheme must contain 
descriptors -which are precise in meaning and succinctly descriptive* 
These descriptors must also be applied \dth equal consistency over the 
entire data base, as discussed later in this paper* 

An analysis of the essential areas of infonnatlon to be described 
in biographical rersrence vorks was necessary before a classification 
scheme could be devised accurately* The analysi::; fo\ind essentially 
three special areas which should be emphasized in biographical worlcs 
to describe them accurately* The three areas werei (1) Hlsoellaneous 
categories, which included regional location within the United States^ 
sex, religion, race, nationality within the United States, and age| 
(2) occupation categories, and (3) time period categories* These 
three areas adequately and yet simply described most biographical 
reference works which contain infonnatlon about people within the 
United States both from Colonial times to the present and in all fields 
of activity* 

TtiXs scheme is outlined on pares eighteen, nineteen and twenty of 
this paper* Pares nineteen and tiienty illustrate the various occupation 
areas and their respective symbols* There are eighteen areas which are 
subdivided into other occupation areas • The large majority of 
blographlcfil reference works can be properly and accurately indexed, 

• 
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Desc riptor Classification Schema 
Field One- Miscellaneous Categories 
Chart no* X - 

A §gC 

B mis 
c FEms 

D UNITED STATES 

£ EAST (includes Middle Atlantic and Northeastern States) 
F SOUTH (includes Southern and South Central States) 

0 ' MIDVIEST (includes Central and Midwestarn States) 
H WEST (includes California 'and the Western States) 

1 NORTHV/EST (includes the States of Oregon and Washington) 
J ALASKA 

K HAWAII 



L RELIGI(»I 





MORMON 


N 


CATHOLIC 


0 


PROTESTANT 


P 


JUDAISM 


Q 


OTHERS 


RA^ 






NEGRO 


T 


INDIAN 


U 


ORIENTAL 


V 


OTHERS 



W NATIONALITY 

"T gJSSFEAN 

Y SCANDINAVIAN 

Z ASUN 

0 AUSTRALUN 

1 ISLAliDS 

2 NORTH AMERICAN (includes Mexico, South America and Canada) 
AGE ... 

3 0 to 16 years old 
U 19 to 35 years old 
5 36 to HO years old 

NOTEj The Classification Scheme will accomodate growth of new sub- 
divisions, simply ty adiing new alpha-mmeric symbols. Pbr example, 
the religious category, "Buddhism" could be represented the 
symbol ''Pi% or the occupation category "Management", by the symbol. 

Classification Scheme can be further expanded 
and subdivided if necessary, simply by inserting these symbols in tiie 
appropriate position within the sehene. jr wo 
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Descriptor Classification Schema 
field Two - Occupation Categories 



.•r Chart no» 1 

+ ALL OCUPATIONS 

A ARTISTS 

B PAINTim 
B £Nr.RAVINQ 
B SCJLPTUBE 

C MUSIC 
C THEATRE 

C MOTION PICTURES 

G RADIO and TELEVISION 

C ENTEJITAINMENT 

D ARCHITECTS 

E ATHLETICS ' 

"B~SPSS!rS AND. GAMES 

P AUIHORS * 
G POETS 
G NOVELISTS 
G PLAYWRIGHT 
G JOURNALISTS 

H CHIIDREN WRITERS 

I BUSINESSMEN 

"J PHILANTHROPISTS 
J BANKING AND FINANCE 
• J lOTSTRY 

K ACCOINTING 

K STATISTICS 

K ECONOMICS 

K ADVERTISING- 

K PUBLIC RELATIONS 

L EDUCATORS (inclAides all levels and all disciplines) 
H LIBRARIANS 

N PROFESSORS (includes all disciplines) 
0 ELEMENTARY AKD SECONDARY EDUCATORS ' 

P ENGINEERING AND TECHNOLOGY 
Q MATHEMATICS • 

Q COMPUTER SCIENCE AND DATA PROCESSING 

R AVIATION 

S ATOMIC SCIENCE 
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Descriptor Classification Schemo 



Field Two - Occupation Categories (con^t) 

T EXPIORERS Chart no. 1 

U ^HUMANITARIANS (social and economic reformers) 

V HUMANITIES 

ENGLISH - See EDUCATOKS 
HISTORY - See EDUCATORS 
IITERAITJRE - See AUTHORS 
( PHILOSOPIiY - See Also EDUCATORS 
W LINGUISTICS - See Also EDUCATORS 



X INVENTORS 
Y IM. 

Z MILITARY SCIEMCS 
P ARMY 
\ NAVY 
2 AIR FORCE 



3 MEDICINE 

"Ti FliJSlCIANS AND SURGEONS 
k NURSES 
5 DENTISTS 

6 PHYSICAL Am BIOLOGICAL SCIEMCfS 

7 ECOLOGY 

7 BOTANY 

7 AGRICULTDHB 

7 AGRONOMY 

8 ZOOLOGY 
8 BIOLOGY 

8 ANIMAL SCIENCE 

9 CHEMISTRY 
J PHYSICS 

C2I0r.0GY 
FOOD SCIENCE 



SOCIAL AI^ BEHAVIORAL SCIENCE 
J SOCIOLOGY 
« POLITICAL SCIENCE 
« STATESMEN 

) PRESIDENTS OF THE UNITED SIATES 
? ANTHROPOLOGISTS 
«» ARCHEOLOOISTS 
$ GENEALOGISTS 
% CSIOQRAPHY 
- PSYCHOLOGY 



- STUDEWTS (includes secondazy and higher education) 
# THEOLOGIANS 

20 
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as far as occupation is concerned, Tsy this set of occupations.. 

Pa^re eighteen illustrates a variety of index, areas, including 
sex, regional location ^jithin the United States, religion, race, 
nationality idthin the United States and age* As the author indexed 
the 170 various works, he found it necessary to index them not only 
in terms of occupation and time period, but in the areas of regional 
location, sex, religion, race, nationality and age as well. For exainple, 
how does one accurately describe such biographical works as, Negro 
Ninistersa ^lo's mo of American Women. Who <s Who of Polish Amerioa. 
Who»s mio of Scandinavian America > Who's ;^o in the Catholic Church or 
Wio^s \'1ho Amonfir High School Students? If the index areas of occupation 
are used along with time period, the above mentioned works could not 
be accuratel>7 described, and hence, they could not be retrieted 
properly. In other wrds, these works need more than Just the two 
index areas of occupation and time period "to describe then accurately. 
The index areas, as illastrated on page eighteen supply this need* 
Because of the limitations of the computer program, the areas 
as outlined on page eighteen could not be intermixed within themselves, 
althou^ they could be intermixed with the occupation areas as outlined 
on pages nineteen and twenty. For example ,v the areas such as religion, 
race and age cannot be mixed together with scientists; only religion 
and scientists, or race and scientists, or age and scientists can be 
mixed together to describe a reference worki This means that a work 
entitled. Catholic Neryo Scientists can only be indexed as Kegro- 
scientists or Catholic-scientists but not Catholic-Negro scientists. 
The work can be indexed in such a way, however, that if a patron 
searched for either Catholic-scientists or Negro-scientists he would 
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retrieve the vork Catholle-Wepro Scientists > In other words, the 
vork is indexed under both index tcnns, but not both at the sane time*. 
This limitation is not serious in terms of patron satisfaction, but 

« 

it does mean the user will, under some situations, have to i>a6s twice 
over the data base to be assured of getting all the works possible* 

A few other e:{planations need to be made concerning the descriptors 
on pare eighteen* The heading SEX is used to describe such works as 
X'^o*g !'?ho of American Women , ^ere the work deals with one sex 
exclusively. The heading TTN7T1SD STATES was divided into regional areas* 
The EAST, for example, denoted all states lying within the area designated 
as the H5ddle Atlantic and Northeastern Sta^ 's* A regional index of 
terms was needed to describe such works as T']ho*s Itlho in the East and 
^■^0*8 T'Jho in the Vest * Other works such as T'Tho's T'Jho in California 
and !#io»s T/Jho in Oregon needed to be indexed with a regional location 
descriptor* The latter two were indexed with the descriptor WEST, since 
that is as specific as the index terms will alXowv 

The heading RELIGION was subdivided into very general denominations, 
since there are vezy few biographical reference vorks which are more 
specific in format* This is al^o true for regional location biographical 
woxics. There are not many states, for example, which publish their 
own biographical works* In other words, there is no need to particu- 
larise a classification scheme with numerous specific index tezms 
which are very rarely used* It is better to have ^at could be called 
"intermediate** index terns which lie between the very general ones 
and the vexy specific ones* 

Another heading on pane eighteen is BACE* !Ihis heading was 
divided into general, but specific-enough, sub-index terms* The heading 
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IttTIOWALITy is similarly divided into areas of the world. A reference 
work entitled, ^-Jho's Vffio in Polish America , received the indesr term 
EUROPEAN, while the work entitled, ^^^o»s ^>Jho Amonff Japane^se aner ieana. 
was indexed by the term ASIAN and so on. The last heading on pa/?© 
twenty is ACK. The "0 to 18»» category is used to index a work such 
as T'?ho»5 !^Jho Ainonff Hi^h School Students > 

There are eighteen ina.1or occupational categories, each one 
subdivided into various areas. The ALL OCCUPATIOIIS categoiy included 
all the other seventeen categories. Ttie BUSIIffiSSMEN category was 
divided into eight subdivisions. The author has not tried to be all 
inclusive with regard to these subdivisions, but has only attexnpted to 
divide the main headings into «»intermediate» subdivisions so that the 
reader may know generally what occupational categories come under which 
headings. As mentioned earlier, there is no need to be very specific 
since most biographical works tend to be general or "intermediate** in 
scope* 

The time period category was not outlined in the "^scriptor 
Classification Scheme" because no set tijne periods were used. If 
work dealt with people living from 1901 to 196$, those years were used 
to describe the vrork. If a work was published in 1936 but was retrospec- 
tive and covered persons living from Colonial times up to 1936, then the 
dates 1600 to 1936 would be used. Most works usually state what time 
period they cover and in these cases it is veiy simple to affix a set 
of beginning and ending dates* However, there are a considerable 
number of works that do not plainly specify the time period covered^ 
A work entitled, !'?ho»s >Tho In the Rockies, for example, was published 
only once, in 1923 • It was not retrospective; that is, it only dealt 
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w^th persons living at the t5me it vas published* Therefore* the 
beginning and ending dates 1923 to 1923 vere aaatmed to it* 

To assign only one year to a biographical reference vork seemed 
too limitinfr. For example* if a patron wanted a work on people in the 
Rockies in 192?, then V^n's T«Jho In the Bocklps would not be retrieved . 
since the date 1922 does not natch the date 1923* At first examination 
this seeias a distinct disadvantage to the user* If he does not \miit 
the precise date underwhich the work is indexed* than there will not be 
a match ^nd the work will not be retrieved. Therefore* the computer 
program was written so that the user may or may not include a date in 
his search* llhis meant that a patron may use only the first t«o groups 
of index areas (miscellaneous and occupation categories) in his search 
expression without including the time period category* 

It would certainly be possible* although it is not presently built 
into the system* to alter the computer prograzQ so that the user could 
Include in his search profile the expression <1^23* which means less 
than or equal to 1933* !Iliis expressiion would ensure that the user 
could retrieve all the relevant works within the data base that were 
equal to or less than the year 1923* Honsver* since this option is 
sot possible within the present system* the user will have to be 
satisfied with the "no date** search routine as explained above* itrliich 
is almost as satisfactory as the "equal to or less than the date*^ 
routine* Ifore vill be said later on about retrieval rules and patron 
ftearch profiles* 

On pages eighteen* nineteen and twenty* there were certain symbols 
assigned to each descriptor*. The symbol A* for example* on page 
rd ghteen means SEX; the symbol S means UEQRO* and so forth* Both 
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the main headJnr'g as well as the subrdescriptors have symbols. The 
bioffraphical reference works, within the data base, were indexed in tenns 
of these symbols* The symbols on pa ere eighteen are used a^ain on pages 
nineteen and twenty. This do^s not matter, since, to the computer, the 
symbol S on parre eighteen (which means JT'imiO) is entirely differentiated 
from the synibol S on page nineteen (which means ATOMIC SCIENTrST). 

To reiterate, assii^ned to every biographical reference work within 
the data base are three fixed fields or categories; one field for 
miscellaneous descriptors, as. found on page eighteen, another field for 
occupation descriptors, as found on pages nineteen and twenty, and a 
third field for time period or dates. To the computer, each of these 
fields is separate and distinct, and it does not matter if the same 
symbols are used twice, as long as they are in different fields. It 
Tjould matterj however, if the symbol S, for example, were used twice 
in the occupation categories* %re was said regarding these fields in 
the discussion of programming techniques and file organization* 

The descriptors on pages eighteen, nineteen, and twenty not onljr 
serve to illustrate the author's classification scheme but also serve 
as an authority list or subject headings list. This is vital, since 
the descriptors, as well as their symbols, must be applied or assigned 
with equal consistency and accuracy over the entire biographical 
reference collection* This is basic to all information retrieval 
systems. Both the ERIC Thesaurus and the library of Congress Sub.lect 
Heading List require the same constraints. 

With the classification scheme established and refined somewhat^ 
the next step was to index all the biographical reference works in the 
General Reference Department of the lee library. All of the 170 works 
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vere individually examined to determine their contents. They were then 
indexed in terms of the three categories previously explained* 

Appendix A is a partial, list of some of the biographical reference 
vorks in the data base* Before each of the titles appears a series of 
symbols such as HDF or These symbols ^ as explained^ mre used to 

signify the descriptors on pages eifrhteen, nineteen and twenty* In 
Appendix A, all of the titles were indented evenly. There was a good 
reason for this* Tliere \mre, as mentioned^ three categories in tdiich all 
vorks were Sjidexed. Kach of the three categories had a fixed field 
format; that is^ each category was allotted so max^ spaces within the 
fixed fields. The first fixed field (miscellaneous) was allotted 
five spaces from the left hand margin or the beginning of the record* 
The second fixed field (ooeupation) was allotted seven spaces and followed 
directly after the first fixed field. The third fixed field (tine 
period or dates) was allotted eight spaces and immediately followed the 
second fixed field* The following example will help to clarify this 
arrangement: 

T) /» 177lil971 BI0C5RAPHTCAL DIRECTORY OP TIIE AMERICAN 
CPNCEESS 1771-1971 Ref 923.2 UN332 

The D , followed by four spaces^ is the first fixed field and 

comprises the miscellaneous cateRory. Only one .of the five spaces 

is filled with symbols, namely the D. The /» followed by five 

spaces is the second fixed field and comprj^ses the occupation category. 

fnly two of the seven spaces are filled with symbols, namely the / 

and the » . The 177la971 is the third fixed field and comprises the 

time period category. All eifht of the spaces allotted to this category 

are filled and always will be filled, even if the same date appears 

twicej such as 19231923* Since a fixed field format was used, it is 

* 
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unnecessary to have spaces beti^een the r1ates> or for that matter 
betveen thf? three fixed fields* The reference mrk, as illustrated 
above y is indexed to mean: ,a bio^.'jphie&l trork containjnfr bionraohies 
of statesmen livlnf: Trithin the TTnited States between 177lj and 1971* 

Aft^r the third fixed field (date cateisrory) there appears such 
information as the title > the dates the work covers (repeated again), 
the reference call nu2hber> and if useful^ explanatozy notations* There 
are 9$ spaces allotted from the date cate^ioxy to the end of the record 
for such information* The notations (per) maan that the work contains 
portraits of Its biographees* All letters were capitalized in the data 
base^ including cutter letters^ since most computer processing deals 
exclusively- vrith upper case letters* 

The following example illustrates another important factor about 

the way in which the works were indexsdt- 

LP ♦1Z/#U9$51965 WHO'S WHO IN JEWRY 19^5-1965 R 922*96 

VJ62S 

In the first fixed f ieldji the letters L and P appear^ which signify 
the descriptors RELIGION AND JUDAISM. In the second fixed field> the 
symbols appear, which mean respectively, ALL OCCUPATIONS, 

OTCATORS, MILITARY SCIENCE, SOCIAL SCIETJCE, STUTESI-IEN, THEOLOGIAN, 
and ARMY* The work was indexed by both the general and "specific" 
(meaning inteimediately specific) descriptors, so that a patron would 
be assured of retrieving the work whether his approach vas general, 
such as RELIGION, or a little more specific such as JUDAISM* Whenever 
a work was indexed by a sub-category such as JU JAISM, ARMY, and 
STAI^BSMEN it was also indexed by its more general heading RELIGION, 
MILITARY SCIENCE, and SOCK^L SCIENTIST* Thus, the work was assured 
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of being rotrieved whether the patron searched under general or more 

specific tenns* 

The double indexing also allows special bibliographies to be 

printed out both for specific as mil as general areas*, above 

work, !«Jho's ^*'!ho in l^forld JeT-ay., could be retrieved by inputing the 

following combinations of s.'iiiiboXss 

L+ or LL or LZ or l/ .or L» or X# or LI or P+ or EL or PZ P 
or P/ or P* or or PI. 

The fact that each work was indexed under not Just one descriptor Srm 

each fields but rather » under laany descriptors from each fixed fields 

makes the system highly practical* 

The author hinself indexed the entire collection of 170 works » 
Sach work was indexed as specifically as possible* but whenever a 
work was indexed h^r specific categories it was also indexed by its 
more general heading* 

Now that the iforks were indexed and checked for accuracy and 
consistency^ the information was stored onto disk through the computer 
terminal* !:ach of the descriptor symbols as well as the bibliographic 
infoxmation vas individually typed into the data file designated as 
"Books*fil»** Corrections were made using the TECO (text editor 
correction) rotitine* It v:as important to maintain the exact forma t» 
as described earlier » as the works were inputed* It took the author 
about seventeen hours to input the data^ make corrections^ and establish 
the data base* This data base was pairtiaUy reproduced as Appendix A 
of this paper* 

'She computer program was the last remaining essential to the 
project* It was written under the direction of Ifr* Bruce Smithy Itr* Don 
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Ifovak> and Kr* Larry Oibsoni all of whom are computer science graduates, 
or students* The program's meaniner» limitations and capabilities were 
fully explained in Appendix. B* In essence^ the program instructs the 
computer how to match the patron's profile with the index terms assigned 
to the biographical reference works* The program was witten using 
the COBOL lanmafie, Moh %\^s e.special^y able to deal with long strinn^ 
of characters, as opposed to nxanbers on tho other hand* 

This program was quite complex sSnce the nature of the task \ms 
complex* ^e machine must be- told in precise detail every step of the 
search rortine. It will not do anything it has not been instructed to 
do* However, the computer has tremendous memory, speed, and accuracy 
capabilities* It was able to ccmpare the search-key with each of the 
170 works and make a match within milliseoonds. The user, sitting at the 
tezninal, usually spends only seconds before the cospiter responds and 
displays works retrieved* 

Another item of importance was the search-key; which is composed 

of s^bols that are typed into the computer just before the execution 

of the pxogz'am* It represents the patron's question in terms of the 

unique descriptor classification scheme* It was essential for the user 

to convert his information need into a set of terms t^hich may be used 

as a search-key* Ihis vas done by using the descriptors listed on pages 

eighteen, nineteen and twenty* The following have been converted to 

s3rrabolio language search-keys; 

I would like a biography of Hank Aaron, could you help me find one? 
This was converted to, SE197li (Negro Athletics in 1971*) 
Or TE197li (U. S . Athletics in 19714) 
- Or DE (U* S» Athletics) 

I would like a biography of Sugene O'Neill, the famous playwright* 
This was converted to, W19$$ (U*S» Playwright in 1955) 
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It was also converted to (U»6» Author In 1$$$) 

Or DF (U.S. Author) 

I vould like a biography of James Chipman, a field Zoologist^ who 

also teaches at the University of Iowa* 

This was converted to, D8197U (tl.S. Zoolopr'sts in 197lj) 

Or D6197I1 (U*S. Physical or Biological Scientist in 197k) 

Or nmsih (U. S. P-ofesaor in 1971) 

Or 08l97lj (Midwestern Zoolof!:ists in 1971) 

Or O6197I4 (Midwestern Physical or Biological Scientist in 1971) 
In each of the above examples, the user went from the specific to 
the more general search-key to ensure retrieval of all works that are 
relevant to the question. The user may or may not search with a date, 
which ilso facilitates retrieval of all possible works. Since a match 
must occur on only two categories, namely miscellaneous and occupation, 
and not the third category of time period, very few works are not 
retrieved that are relevant to the patron^s question. However the user 
must determine in what categories the patron<s biographee falls* Hank 
Aaron, for example, is a famous Negro baseball player; however, not all 
biographees are as well known* Sometimes, the librarian must question 
the patron about t^ich miscellaneous, occupational and time period 
eat'egories fit the biographee* 

It was important to convert the question into the proper sjmibols, 
since the works themselves were indexed in texns of those symbols* Ihe 
yhole retrieval system was based on the matching of the search-key 
symbols with the s.vmbols that indexed the biographical reference worits 
within the data base. Moreover, for a "match" to have oocured, and for 
works to be retrieved, all elements must be matched* This meant that if 
the search-key appeared as, ISElSlh (U.S. Athletics In 1971*), all three 
e^^ments haiat be matched before any work can be retrieved* In other 
words, the D and E and ISlh must all be matched! This is similar to 
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the "andinn*' of words together in other retrieval systems. 

Appendix C illustrated several searches actually made within the 
system and demonstrated its versatility* These searches > in Appendix C 
illustrate different combinations of descriptors vHnlch yield different 
results* T'Jhen the date is left out of the seaxuh-key> the nuiriber of 
reference viorks retrieved increases* Several sub.iect bibliographies 
produced by the system apueared near the end of Appendix C* This in 
itself provides a valuable service to students and teachers .ilike* For 
more information on the sesrch-kay, the f'etails of the computer program, 
as well as the ability of the vner to se«trch' both with or without a 
date. Appendix 6 was written*- 
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CK/IPTER IV 

SmikK[ AND CONCLUSIONS 

The problem^ briefly restated was to construct an automated 
retrieval system for biographical reference works* This retrieval system^ 
although limited in scope, was to serve as a prototype of i\iture retrieval 
systems and to illustrate the capabilities and potentialities of 
computerized reference retrieval systems* 

Such a prototype system was indeed successfully designed and 
in^lemented. Ttie system was based on the same methodology used by Veil 
and Mereditli in their systems, thereby giving credence to the statement 
that indeed there are some universal or at least general principles and 
procedures to follow in building an automated system* However, it was 
apparent, during the course of the project, that much research and 
further study needed to be undertaken in the area of reference automatics 
before such a eystem can be ideally used at the patron service desk* 
The main advantages of computerized systems over the conventional system 
are the following { 

(1) The memoxy, speed, and a&curacy capabilities of the cospiter 
can be utilized to their full extent* 

(2) With the availability of leasing computer tijoe f2>om service 
bureaus and with access to time sharing computer :^stems, the usage of 
heretofore expensive equipment is now becoming cost effective* 

(3) It is very easy, with the capabilities of the computer, to 
combine various descriptors which are both general and specific, whereas 
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the conventional reference works, such a? Winchell and Slocuia, can only . 
index biographical works under one descriptor at a time* Ihe computer 
is easily able to combine any number of terms and. search for a match^. 

(k) Special bibliographies on various .vubjocts are easily 
produced with great speed and accuracy using the computer. This in itself 
is valuable to teachers as well as students* 

($) Since reference works are already on disk in machine-readabla 
form, for the purpose of reference retrieval, it is also possible to use 
the data base to help select new materials. The librarian, with the aid 
of the computer, can readily spot deficiencies within the present 
collection* 

(6) The ultijnate and most important usefulness of reference 
automated retrieval systema is to the patron* Hopefully, in the not* 
too«distant ftiture, entire reference collections can be in machine— 
readable form* Only time and more research stand in the way of transfozn- 
ing the world of reference work and libraries into more eff iclentt 
speedy, and even economical data and information retrieval systems*- 

With the foregoing elaboration of the advantages of retrieval 
systems, it is appropriate to enumerate also the disadvantages, limitations^ 
and problems associated with developing such systemsr 

(1) Ary automated reference retrieval system, no matter how con5>le3C 
■ and elaborate the hardware and even the software, is only as good as thf» 
classification scheme that is used to index the works. The problems 
associated with "surrogates'* and their respective ambiguity applies here* 
Tha whole notion of what is relevant and what is necessary to represent 
through surrogates appears as an ever puzzling problem. What one person* 
such as the indexer of reference works, may think is relevant, may be 
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completely different ifrom what the patron believes is relevant* In 
other words, the computer and the software can only judge the relevance 
of a reference work based upon the index classification scheme, which 
may itself be relevant to one perscu but inaccurate and irrelevant to 
another* 

(2) Associated with the first disadvantage, is the problem of 
applying the index classification scheme to the collection of reference 
works* Even indexing the collection by one person, as was the case with 
this pro;ject, does not ensure that the descriptors have been applied 
consistently. On the contrary, even when one person indexes the works, 
much variation occurs* 

(3) ISie oaleats of an ^intermediary** (be it a professional 
reference librar^n, or a trained library assistant) are necessazy to 
question the patron, deter/nine his needs, formulate a search-key profile, 
and input it into the computer^ 

(k) Although the system vas only intended to be a prototype of 
future retrieval systems, much can be learned through user studies* At 
present, no automated retrieval system, ii:.cluding the author's, has 
been very extensively used at the reference detilr level* Certainly, this 
is a major milestone to be accomplished in the future. 

($) At present, the author's system was structured that the 
matching routine was accomplished using a sequential search* Thib- means 
that the computer has to compare the search-key with all of the works 
in the data base one at !i time, starting from the beginning and going to 
the end of the file* Here experimentation is needed to determine the 
optintum file organization for increasingly larger and larger data bases, 
since sequential searching routines can be both inefficient and expensive 
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Mhen dealing ^ith large data bases* , 

(6) As mezitioned earlier in this paper« Weil'Sj Meredith's and thd 
author's systems^ were not able to rank the reference sources in order 
of iniportance* Ideally, those Korks most likely to give the patron & 
correct answer would appear first > while less likely sources would appear 
later. 

(7) For automated systems to become completely realistic, they 
must be able to deal with more sophisticated, philosophical questions and 
not only simple, data retrieval, type questions* 

It is recommended that additional research pro;iects be devoted to 
experimentation with reference retrieval systems, and especially to the 
building of classification schemes, which ultimately limit the capabilities 
and are the basis of any retrieval system* If constant research is 
applied, the problems and loop holes of automated reference retrieval 
systems can be overcome* Evantually such systems will provide a realistic 
answer to libraries' - rapidly expanding problem of both bibliographic 
control as well as accessibility of information^ 
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The real core of the program is conjoined in lines 350 -Uiroagh 
800* Lines prior to 350 serve to establish variables and set formats* 
For example » the data base ''books •fil" is established, as are the word 
and letter variables, "end-word, *» "search-key," "found-it," "place-key," 
"occupation-key," "date," "I," and "J," Tha program allows the 
"place field" (aforementioned as miscellaneous field) to occupy five 
spaces, the "occupation field" to occupy seven spaces, and the "date field" 
to occupy eight spaces, (four spaces for the beginning date and four 
spaces for the ending date)* The program also allows the title and 
miscellazieous infozTnation to occupy 95 spaces* 

The program is written so that the computer prints or displays on 
the screen certain phrases that are helpful to the execution of the 
program. For example, the computer displays, "l^ype in search-key in the 
following format, ^Jhere P=>Place, 0«OQcupation, DDJDD^Year*** This helps 
the user to know in what order he is to type in his search profile* 

Lines 370 through 780 are composed of a series of sL^le loops which 
compare each symbol in the patron profile or search-key with each symbol 
in the data base* For example, suppose the following search-key were . 
typed into the computer for a searchc 

M-^l^OO 

(Incidentally, this means a biography of a Mormon of any occupation who 
lived in 1900*) Only two symbols and one year can be submitted for search • 
at one time* The user could not, for Sample, type in M-t-11900, but he 
could make as many repeated searches, one after the other, as he desired^ 
using both "specific" as well as general symbols* The computer will take 
the first symbol of the search-key, which in this case is the letter H, 
and compare that letter with all of the symbols located in the first five 
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spaces or the nisceilaneous fixed field ef each of the 170 mrks. The 
computer will determine if the letter M matches with any other N« If . 
there is a natch^ those works are tag^red and set aside* 

The machine then takes the next symbol of the search-key, which in 
this case is + and compares that symbol ;d,th the SRven symbols or spaces 
in the occupation fixed field, which Smrredistely follows the five spaces 
of the miscellaneous field. However, the computer will only attempt to 
match the + xd.th those works that x^re^e matched with the M» In this way, 
the 170 works are narrowed down each time a match is made with each 
symbol. If there is a match w5th the symbol +, the computer then tags 
or identifies those iforks and sets them aside* 

The date within the search-key is taken next* In this case the date 
is 1900. The computer attempts to match this date with those works that 
ha\'e been set aside and have matched both -with the M and the •t'* The 
machine takes the two dates, within the date field, and simply says, 
*'Is the search-key date (which is l^OO) more than the begizining date and 
less than the ending date?** If so, then a match has taken place* 

Those works that have a match on all three fields, are tagged and set 
aside in a variable called **title-table*" The program then instructs 
the computer to print out "title-table,'^ which is a list of those worics, 
taken from the 170 vnrks, that matched on all three areas of the search- 
key. Any other work that matched on two areas or less is not retrieved. 

It i^s mentioned earlier that one apparent disadvantage of the 
indexing \jas that a substantial amount of works were indexed in terms of 
only one year, such as 1923 to 1923* If the patron does not type in the 
precise date within his search-key then a match will not occur « Thus, 
mai^sr works could not be z^tricved singly because the year td.thin the 

48 



1*3 

soaroh-key did not happen to match the years indexed in the reference 
works. This is certainly a disadvantage^ however, the proeram is written 
so that a tiser may or may not make a search with a date* At the beginning 
of the proj»ram the compnter is instructed to type out, '»search for date?* 
If the user types 3n Y, then a date nust be entered in the search-key; 
however, if the user types in W, then a date must not be entered in the 
search-key. This is a . significant advantape in the retrieval inile. 

Initially, a user should type in a date, l^oreover, he should also be 
as specific as possible within the classification scheme and the descriptors* 
Tf there is a "no match," or if there are only a few matches, then the 
user should make another run and should be more general in his search- 
key, and if desired, may search without a date* This will increase the 
number of titles retrieved* Even though more titles are retrieved without 
a date, it is easy for the user to separate the relevant from the 
irrelevant since the dates the wrk covers are. displayed with the title. 

The follovTincr example will help to illustrate what happens when 
th^ user 5?earches specifically, then, more generally, and finally without 
a date* The search-key T5l;190d was typed into the computer, which means 
a biography of a United States physican living in 1900. No references 
were found mth that search-key so it vas changed and resubmitted as 
DL192O* Still no match occured so the user omitted the date altorjether, 
and entered Tih* The following works ^rere retrieved using that search- 
key? 

Biorranhical Directory of Para-psychologists * 
^^p'g T'Jho in Medicine * 

Biofrraphical Dictionary of Eminent American Physicians * 
Still desiring more works, the user altered his search-key to 1)3, which 
means a biography of United States men of medicine* This made the search* 
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key even more general. The following -works were retrieved using the 

more general search-ksyt 

Bio pranhical Directory of Para-psychologiste # 
T?^o«s Mio in Medicine 

HioyraTjhical Dict-^'onary of Eminent American Physicians ^ 
Qijtgtandj.nn- Younp- Vfomen of America (includes Nurses) 
T'Hio'fi Ir.T?ortant in I^dicine 
American Medical Biopyaphy 

Biop-ranhies in the Nei-r York Academy of Medicine 
The ability of the user to be both specific and general^ as veil as 
to search both with or without a date^ is extremely valuable* 

A final aspect of the program needs explanation* Horr will the 
user know if there is no match on the search-key? The program instructs 
the machine to print out the phrase, "Unable to locate any references 
with that key; if you desire another search, type "KOH."B," otherwise 
type "END." If the patron types "END" the program will not run again; 
however, if he types "MORE," the program will repeat itself. This 
allovrs the patron to execute, very quickly and as many times as desired^ 
the search routine. Depending upon the con^lexity of the search, the 
user usually will only spend a matter of seconds on the terminal to 
arrive at a satisfactory answer* 
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APPENDIX C 
SEARCH EmiBJSS 



Chart no» 2 
Search Examples 

Search-keys 

D91970 (U. S. Chemist in 1970) 
Works Retrievedt 

American Men of Science 1906-I97I R 925 Am35 (Social, Behavioral^ 
Physical and Biological Sciences) 

Search-key t 

D9 (U. S. Chemist) 
Works Retrievedt 

American Men of Science 1906-1971 R 921; Am35 (Social, Behavioral, 
Physical, and Biological Sciences) 

Chemical Who's Vflio 1928-1955 R 925.1; Cl;2 
Search-keys 

D6197O (U. S. Physical and Biological Scientists in 1970) 
Works Retrieved; 

American Men of Science 1906-1971 R 925 Ara35 (Social, Behavioral, 
Physical and Biological Sciences) 

Dictionary of Scientific Biography 1600-1970 R 925 D36I 
American Entomologists 1750-1971 R 925.9 M296a (por) 
Leaders of American Conservation 1900-1971 R 926.3l;9 OjSUL 

Search-keys 

X-t>1965 (European-Americans of all occupations in 1965) 
Works Retrieveds 

Italian American Who's Who 1935-1967 R 920.073 Itl 
Who's Who in Polish America 1850-1970 R 920.073 W62 

» 
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Chart no* 2 
Search Examples 

Search-key* 

3- (U* S. Students 0 to 16 years old) 
Works Retrieved} 

Merits ;-Jho«s Who Among Americ an HiTh School Students 1966-196? 

Search-keys 

C+1966 (U. S, Women of all occupations in 1966) 
Works Retrieved: 

Who's Vflio of American Women 1956-197li R 920.0? W62h 
Outstanding Young Women of America 1966-1973 R 920,?062 Ou6 

Search-key: 

M+1898 (Mormons of all occupations in 1898) 

Works Retrieved^ 

Portrait. Genealogical and Biographical Record of the State 
of Utah 1890-1902 R 920,0?32 N21p 

LPS Biographical Encyclopedia 1830-1930 R M2?0 Jl6lh (por) 
Pioneers and Prominent Men of Utah l81i?-1910 R M2?0 Bh$lh 

Search-keys 

DF19?0 (U* S, Authors in 19?0) 

Works Retrieved? 

International Celebrity Register 1900-1973 R 920.073 In8 

Junior Book of Authors 1900-1972 R 929.08 K96j (por) 

Authors and Writers Who»3 Who of English Speaking World 
1934-1971 R 928.2 Au91 
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Chart no* 2 
Search Examples 

Seareh*keyt 

DF ( U* S. Authors) 

Works Retrievedt 

International Celebrity Register 1900-1973 R 920.073 InS 
A Biocrraphieal Corrmanion to the Arts. 1700-1965 R 927 At63 
Dictionary of American Authors 1700-1905 R 928 Adl9 
The American Literary Yearbook l8$0-1919 R 928 Ani35 

The Biographical Dictionary of Contempo rv Poets 1850-1938 
R 928 B52 

American Authors 1600-1900 R 928 K96am 

Authors of Books for Young People. 1850-1967 R 92& W2l5a 

Who* 3 Who Among North American Authors 1921-1939 R 928 W62ii 

Allibones Dictionary of Authors 16OO-I89I R 928»08 Al58 

Conte mporary Authors 1900-1962 R 928,08 0767 

Junior Book of Authors 1900-1972 R 929.08 K96t (per) 

20th Century Writing 1900-1969 R 928.08 R39Ut 

Autiiors_and.Wr iters Who's Who of the English Speaking World 
i93U-197i"R 92872 AuSl 
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Chart no* 2 
Search Examples 

Search-keys 

H+ (Western United States persons of all occupations) 

Works Eetrieveds 

National Register of Prominent Americans and International 
Notables 1970-1971 ft 920,075 N811 

Personalities of the West and Midwest 1962-1972 R 920*079 PU32 
Who's Vflio in the West. 19ii9"1973 R 920.078 W62m 
Who«s Who in the Rockies 1926-1933 R 920.0783 L233w (por) 
Who's vmo in Colora do. 1958-1968 R 920,0788 W62 

Capitol's Who's Who in Denver and Colorado 19li0-19li2 R 920.0788 W62c 
Vflio's Who in New Mexico, 1937-1937 R 920.0739 W63 . 
WhQts tflio on the Pacific Coas t. 1913-19li9 R 920.079 W62 

P ortrait. Gene alog ical an d Biogr aphical Record of the State _of 
Utah. . 1890-1902 R 920.0792 N21f 

Utah's Distin^ished Personalities 1932-1933 R 920.0792 SiU7a 

LPS Biographical Encyclopedi a. 1930-1950 R M270 Jl^lb (per) 

Pioneers and Prominent Men of Uta h 1817-1910 R M270 E877 (por) 

Press Re ference Library 1850-1913 ft 920.0791* P926w (per) 
TIncIuaes California figures) 

Who's W ho in California 1955-1973 ft 920.079li W62 (por) 

Who's Who in T.os An geles County 1928-1929 R 920.079li W621 (por) 

Who's Vflio in the Pacific Southwest 1912-1913 R 920.O79U V62p 
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